AITopics

arXiv.org Artificial IntelligenceApr-12-2024

Explaining the Machine Learning Solution of the Ising Model

Alamino, Roberto C.

As powerful as machine learning (ML) techniques are in solving problems involving data with large dimensionality, explaining the results from the fitted parameters remains a challenging task of utmost importance, especially in physics applications. This work shows how this can be accomplished for the ferromagnetic Ising model, the main target of several ML studies in statistical physics. Here it is demonstrated that the successful unsupervised identification of the phases and order parameter by principal component analysis, a common method in those studies, detects that the magnetization per spin has its greatest variation with the temperature, the actual control parameter of the phase transition. Then, by using a neural network (NN) without hidden layers (the simplest possible) and informed by the symmetry of the Hamiltonian, an explanation is provided for the strategy used in finding the supervised learning solution for the critical temperature of the model's continuous phase transition. This allows the prediction of the minimal extension of the NN to solve the problem when the symmetry is not known, which becomes also explainable. These results pave the way to a physics-informed explainable generalized framework, enabling the extraction of physical laws and principles from the parameters of the models.

configuration, critical temperature, magnetization, (16 more...)

2402.11701

Country:

North America > United States > New York (0.04)
North America > United States > District of Columbia > Washington (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Singapore (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Cimpeanu, Theodor, Pereira, Luis Moniz, Han, The Anh

Co-evolution of Social and Non-Social Guilt

arXiv.org Artificial IntelligenceFeb-20-2023

Building ethical machines may involve bestowing upon them the emotional capacity to self-evaluate and repent on their actions. While reparative measures, such as apologies, are often considered as possible strategic interactions, the explicit evolution of the emotion of guilt as a behavioural phenotype is not yet well understood. Here, we study the co-evolution of social and non-social guilt of homogeneous or heterogeneous populations, including well-mixed, lattice and scale-free networks. Socially aware guilt comes at a cost, as it requires agents to make demanding efforts to observe and understand the internal state and behaviour of others, while non-social guilt only requires the awareness of the agents' own state and hence incurs no social cost. Those choosing to be non-social are however more sensitive to exploitation by other agents due to their social unawareness. Resorting to methods from evolutionary game theory, we study analytically, and through extensive numerical and agent-based simulations, whether and how such social and non-social guilt can evolve and deploy, depending on the underlying structure of the populations, or systems, of agents. The results show that, in both lattice and scale-free networks, emotional guilt prone strategies are dominant for a larger range of the guilt and social costs incurred, compared to the well-mixed population setting, leading therefore to significantly higher levels of cooperation for a wider range of the costs. In structured population settings, both social and non-social guilt can evolve and deploy through clustering with emotional prone strategies, allowing them to be protected from exploiters, especially in case of non-social (less costly) strategies. Overall, our findings provide important insights into the design and engineering of self-organised and distributed cooperative multi-agent systems.

agent, artificial intelligence, emotion, (16 more...)

2302.09859

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > Scotland > Fife > St. Andrews (0.04)
(2 more...)

Genre: Research Report > New Finding (0.68)

Industry:

Health & Medicine > Therapeutic Area (0.46)
Leisure & Entertainment > Games (0.34)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.66)

arXiv.org Artificial IntelligenceOct-29-2022

Neural network quantum state with proximal optimization: a ground-state searching scheme based on variational Monte Carlo

Chen, Feng, Xue, Ming

Neural network quantum states (NQS), incorporating with variational Monte Carlo (VMC) method, are shown to be a promising way to investigate quantum many-body physics. Whereas vanilla VMC methods perform one gradient update per sample, we introduce a novel objective function with proximal optimization (PO) that enables multiple updates via reusing the mismatched samples. Our VMC-PO method keeps the advantage of the previous importance sampling gradient optimization algorithm [L. Yang, {\it et al}, Phys. Rev. Research {\bf 2}, 012039(R)(2020)] that efficiently uses sampled states. PO mitigates the numerical instabilities during network updates, which is similar to stochastic reconfiguration (SR) methods, but achieves an alternative and simpler implement with lower computational complexity. We investigate the performance of our VMC-PO algorithm for ground-state searching with a 1-dimensional transverse-field Ising model and 2-dimensional Heisenberg antiferromagnet on a square lattice, and demonstrate that the reached ground-state energies are comparable to state-of-the-art results.

artificial intelligence, deep learning, machine learning, (19 more...)

2210.16493

Country:

Asia > China > Jiangsu Province > Nanjing (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > France (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.48)

Hibat-Allah, Mohamed, Melko, Roger G., Carrasquilla, Juan

Supplementing Recurrent Neural Network Wave Functions with Symmetry and Annealing to Improve Accuracy

arXiv.org Artificial IntelligenceJul-28-2022

Recurrent neural networks (RNNs) are a class of neural networks that have emerged from the paradigm of artificial intelligence and has enabled lots of interesting advances in the field of natural language processing. Interestingly, these architectures were shown to be powerful ansatze to approximate the ground state of quantum systems. Here, we build over the results of [Phys. Rev. Research 2, 023358 (2020)] and construct a more powerful RNN wave function ansatz in two dimensions. We use symmetry and annealing to obtain accurate estimates of ground state energies of the two-dimensional (2D) Heisenberg model, on the square lattice and on the triangular lattice. We show that our method is superior to Density Matrix Renormalisation Group (DMRG) for system sizes larger than or equal to $14 \times 14$ on the triangular lattice.

artificial intelligence, lattice, machine learning, (18 more...)

2207.14314

Country:

North America > Canada > Ontario > Waterloo Region > Waterloo (0.14)
North America > Canada > Ontario > Toronto (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Kim, Dongkyu, Kim, Dong-Hee

Emergence of a finite-size-scaling function in the supervised learning of the Ising phase transition

arXiv.org Machine LearningOct-1-2020

We investigate the connection between the supervised learning of the binary phase classification in the ferromagnetic Ising model and the standard finite-size-scaling theory of the second-order phase transition. Proposing a minimal one-free-parameter neural network model, we analytically formulate the supervised learning problem for the canonical ensemble being used as a training data set. We show that just one free parameter is capable enough to describe the data-driven emergence of the universal finite-size-scaling function in the network output that is observed in a large neural network, theoretically validating its critical point prediction for unseen test data from different underlying lattices yet in the same universality class of the Ising criticality. We also numerically demonstrate the interpretation with the proposed one-parameter model by providing an example of finding a critical point with the learning of the Landau mean-field free energy being applied to the real data set from the uncorrelated random scale-free graph with a large degree exponent.

artificial intelligence, machine learning, neural network, (17 more...)

2010.00351

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre: Research Report (0.82)

Industry: Education (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

arXiv.org Machine LearningOct-4-2018

Machine learning electron correlation in a disordered medium

Ma, Jianhua, Zhang, Puhan, Tan, Yaohua, Ghosh, Avik W., Chern, Gia-Wei

Learning from data has led to a paradigm shift in computational materials science. In particular, it has been shown that neural networks can learn the potential energy surface and interatomic forces through examples, thus bypassing the computationally expensive density functional theory calculations. Combining many-body techniques with a deep learning approach, we demonstrate that a fully-connected neural network is able to learn the complex collective behavior of electrons in strongly correlated systems. Specifically, we consider the Anderson-Hubbard (AH) model, which is a canonical system for studying the interplay between electron correlation and strong localization. The ground states of the AH model on a square lattice are obtained using the real-space Gutzwiller method. The obtained solutions are used to train a multi-task multi-layer neural network, which subsequently can accurately predict quantities such as the local probability of double occupation and the quasiparticle weight, given the disorder potential in the neighborhood as the input.

electron correlation, phy, rev, (12 more...)

doi: 10.1103/PhysRevB.99.085118

1810.02323

Country:

North America > United States > Virginia > Albemarle County > Charlottesville (0.14)
North America > United States > New York > New York County > New York City (0.04)
Europe > Italy > Sardinia (0.04)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Soloveychik, Ilya, Tarokh, Vahid

Region Detection in Markov Random Fields: Gaussian Case

arXiv.org Machine LearningMar-22-2018

In this work we consider the problem of model selection in Gaussian Markov fields in the sample deficient scenario. The benchmark information-theoretic results in the case of d-regular graphs require the number of samples to be at least proportional to the logarithm of the number of vertices to allow consistent graph recovery. When the number of samples is less than this amount, reliable detection of all edges is impossible. In many applications, it is more important to learn the distribution of the edge (coupling) parameters over the network than the specific locations of the edges. Assuming that the entire graph can be partitioned into a number of spatial regions with similar edge parameters and reasonably regular boundaries, we develop new information-theoretic sample complexity bounds and show that even bounded number of samples can be enough to consistently recover these regions. We also introduce and analyze an efficient region growing algorithm capable of recovering the regions with high accuracy. We show that it is consistent and demonstrate its performance benefits in synthetic simulations. Markov random fields, or undirected probabilistic graphical models, provide a structured representation of the joint distributions of families of random variables. A Markov random field is an association of a set of random variables with the vertices of a graph, where the missing edges describe conditional independence properties among the variables [1]. It was shown by Hammersley and Clifford in their unpublished work [1] that the joint probability distribution specified by such a model factorizes according to the underlying graph. The practical importance of Markov random field is hard to overestimate. They have been applied to a large number of fields, including bioinformatics, social science, control theory, civil engineering, political science, epidemiology, image processing, marketing analysis, and many others. For instance, a graphical model may be used to represent friendships between people in a social network [3] or links between organisms with the propensity to spread an infectious disease [28]. This work was supported by the Fulbright Foundation and Office of Navy Research grant N00014-17-1-2075. 2 Given the graph structure, the most common computational tasks include calculating marginals, maximum a posteriori assignments, the partition function, sampling from the distribution and other questions of statistical inference. On the other hand, in many applications estimating the unknown edge structure of the underlying graph, also known as model selection or inverse problem, has attracted a great deal of attention. Naturally, both problems are essentially challenging especially in high dimensional scenarios and are known to be NPhard for general models [2, 3]. A variety of methods have been proposed to address this problem.

artificial intelligence, graph, machine learning, (17 more...)

1802.03848

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
North America > United States > New York (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report (0.64)

Industry: Health & Medicine (0.85)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

arXiv.org Machine LearningMar-10-2015

L_1-regularized Boltzmann machine learning using majorizer minimization

Ohzeki, Masayuki

We propose an inference method to estimate sparse interactions and biases according to Boltzmann machine learning. The basis of this method is $L_1$ regularization, which is often used in compressed sensing, a technique for reconstructing sparse input signals from undersampled outputs. $L_1$ regularization impedes the simple application of the gradient method, which optimizes the cost function that leads to accurate estimations, owing to the cost function's lack of smoothness. In this study, we utilize the majorizer minimization method, which is a well-known technique implemented in optimization problems, to avoid the non-smoothness of the cost function. By using the majorizer minimization method, we elucidate essentially relevant biases and interactions from given data with seemingly strongly-correlated components.

artificial intelligence, interaction, machine learning, (15 more...)

doi: 10.7566/JPSJ.84.054801

1503.03132

Country: Asia > Japan (0.15)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.73)